An overview of topic modeling and its current applications in bioinformatics
نویسندگان
چکیده
BACKGROUND With the rapid accumulation of biological datasets, machine learning methods designed to automate data analysis are urgently needed. In recent years, so-called topic models that originated from the field of natural language processing have been receiving much attention in bioinformatics because of their interpretability. Our aim was to review the application and development of topic models for bioinformatics. DESCRIPTION This paper starts with the description of a topic model, with a focus on the understanding of topic modeling. A general outline is provided on how to build an application in a topic model and how to develop a topic model. Meanwhile, the literature on application of topic models to biological data was searched and analyzed in depth. According to the types of models and the analogy between the concept of document-topic-word and a biological object (as well as the tasks of a topic model), we categorized the related studies and provided an outlook on the use of topic models for the development of bioinformatics applications. CONCLUSION Topic modeling is a useful method (in contrast to the traditional means of data reduction in bioinformatics) and enhances researchers' ability to interpret biological information. Nevertheless, due to the lack of topic models optimized for specific biological data, the studies on topic modeling in biological data still have a long and challenging road ahead. We believe that topic models are a promising method for various applications in bioinformatics research.
منابع مشابه
An Overview of Computer Aided Design/Computer Aided Manufacturing (CAD/CAM) in Restorative Dentistry
Objective: To review the current knowledge of CAD/CAM in dentistry and its development in the mentioned field. Sources: An electronic search was conducted across Ovid Medline, complemented by manual search across individual databases, such as Cochrane, Medline and ISI Web of Science databases and Google Scholar for literature analysis on the mentioned topic. The studies were reviewed thoroughly...
متن کاملAn Overview of Hydroelectric Power Plant: Operation, Modeling, and Control
Renewable energy provides twenty percent of electricity generation worldwide. Hydroelectric power is the cheapest way to generate electricity today. It is a renewable source of energy and provides almost one-fifth of electricity in the world. Also, it generates electricity using a renewable natural resource and accounting for six percent of worldwide energy supply or about fifteen percent of th...
متن کاملAn Overview of the New Feature Selection Methods in Finite Mixture of Regression Models
Variable (feature) selection has attracted much attention in contemporary statistical learning and recent scientific research. This is mainly due to the rapid advancement in modern technology that allows scientists to collect data of unprecedented size and complexity. One type of statistical problem in such applications is concerned with modeling an output variable as a function of a sma...
متن کاملToxicity of Nanoparticles and an Overview of Current Experimental Models
Nanotechnology is a rapidly growing field having potential applications in many areas. Nanoparticles (NPs) have been studied for cell toxicity, immunotoxicity, and genotoxicity. Tetrazolium-based assays such as MTT, MTS, and WST-1 are used to determine cell viability. Cell inflammatory response induced by NPs is checked by measuring inflammatory biomarkers, such as IL-8, IL-6, and tumor necrosi...
متن کاملStructural Equation Modeling (SEM) in Health Sciences Education Researches: An Overview of the Method and Its Application
Introduction: There are many situations through which researchers of human sciences particularly in health sciences education attempt to assess relationships of variables. Moreover researchers may be willing to assess overall fit of theoretical models with the data emerged from the study population. This review introduces the structural equation models method and its application in health scien...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 5 شماره
صفحات -
تاریخ انتشار 2016